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Abstract 

Wc consider a branching model for a population of dividing cells infected by para- 
sites. Each cell receives parasites by inheritance from its mother cell and independent 
contamination from outside the cell population. Parasites multiply randomly inside 
the cell and are shared randomly between the two daughter cells when the cell di- 
vides. The law of the number of parasites which contaminate a given cell depends 
only on whether the cell is already infected or not. We determine first the asymp- 
totic behavior of branching processes in random environment with state dependent 
immigration, which gives the convergence in distribution of the number of parasites 
in a cell line. We then derive a law of large numbers for the asymptotic proportions 
of cells with a given number of parasites. The main tools are branching processes in 
random environment and laws of large numbers for Markov tree. 
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1 Introduction 



We consider the following model for cell division with parasite infection and state 
dependent contamination. The cell population starts from one single cell and divides in 
discrete time. At each generation, 

(i) the parasites multiply randomly inside the cells, 

(ii) each cell is contaminated by a random number of parasites which come from outside 
the cell population, 

(iii) each cell divides into two daughter cells and the parasites are shared randomly into 
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the two daughter cells. 

It is convenient to distinguish a first daughter cell called and a second one called 
1. We denote by T = U ne ^{0, l} n the binary genealogical tree of the cell population, by 
G n the set of cells in generation n and by Z\ the number of parasites of cell i £ T. We 
write then iO and il the two daughter cells of the cell i G T. 

First, we describe by a branching process the random multiplication and sharing 
of parasites in the cell, i.e. this branching process combines (i) and (iii). Second, we 
describe the random contamination (ii) by immigration. Finally, we combine both in an 
i.i.d. manner to fully describe the model. 

I Parasite infection and cell division For every cell, we choose randomly a mechanism 
for multiplication of the parasites inside and sharing of their offspring when the cell 
divides. This mechanism is independent and identically distributed for every cell. Its 
distribution is specified by a random couple probability generating function (p.g.f) f. 
This means that f is a.s. the p.g.f of a pair of random variables taking values in N. 

More precisely let (/i)ieT be a sequence of i.i.d. couple p.g.f distributed as f . For each 
cell 1, /i gives the reproduction law and sharing of the offspring of its parasites in the 
following way. For every i G T, let (X^ (i), X^ (i))feeN De a sequence of r.v. such that 
conditionally on fj = g, (.xl (i), (i))keN are i-i-d. with common couple p.g.f g: 

Vi G T, Mk G N, Vs, t G [0, 1], E(s x * {l) | f, = g) = g(s, t). 

Then, in each generation, each parasite k of the cell i gives birth to xj?\i) + xjp(i) 

children, X^\i) of which go into the first daughter cell and xj^\i) of which go into the 
second one, when the cell divides. This is a more general model for parasite infection and 
cell division than the model studied in [8], where there was no random environment (f 
was deterministic) and the the total number of parasites was a Galton Watson process. 
See |15| for the original model in continuous time. 

Our model includes also the two following natural models, with random binomial 
repartition of parasites. Let Z be a random variable in N and (P^^j be i.i.d. random 
variable in [0,1]. In each generation, every parasite multiplies independently with the 
same reproduction law Z. Thus parasites follow a Galton Watson process. Moreover 
P 1 gives the mean fraction of parasites of the cell i which goes into the first daughter 
cell when the cell divides. More precisely, conditionally on P 1 = p, every parasite of the 
mother cell i chooses independently the first daughter cell with probability p (and the 
second one with probability 1 — p). 

It contains also the following model. Every parasite gives birth independently to a 
random cluster of parasites of size Z and conditionally on P 1 = p, every cluster of 
parasite goes independently into the first cell with probability p (and into the second 
one with probability 1 — p) . 

We want to take into account asymmetric repartition of parasites and do not make 
any assumption about f. Indeed unequal sharing have been observed when the cell 
divides, see e.g. experiments of M. de Paepe, G. Paul and F. Taddei at TaMaRa's 
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Laboratory (Hpital Necker, Paris) who have infected the bacteria E. Coli with a lysogen 
bacteriophage M13 [20]. In Section 16, 1\ we consider this model where a cell receive 
parasites only by inheritance from its mother cell. We determine when the number of 
infected cells becomes negligible compared to the number of cells when the generation 
tends to infinity. 

II State dependent contamination In each generation, each cell may be contaminated 
by a random number of parasites which also multiply randomly and are shared randomly 
between the two daughter cells. This contamination depends only on whether the cell 
already contains parasites or not. 

More formally, if a cell 1 contains x parasites, the contamination brings parasites 
to the first daughter cell of 1 and yJ 1 ^ to the second one, where 

Vx > 1, Yi-A I Y« , Y := Y Q {0) = ^ • 

Moreover we assume that contamination satisfies 

< P(Y = 0) < 1, < P(Y = 0), (1) 

which means that each non-infected cell may be contaminated with a positive probability 
but the cells are not contaminated with probability one. 

This model contains the case when the contamination is independent of the number 
of parasites in the cell (Yq and Y\ are identically distributed). It also takes into account 
the case when only non infected cells can be contaminated (Yi = a.s.) and the case 
when infected cells are 'weaker' and parasites contaminate them easier (Yi > Yq a.s.). 
For biological and technical reasons, we dot make Y x depend on i > 1. But the results 
given here could be generalized to the case when the contamination depends on the 
number of parasites x inside the cells soon as x is less than some fixed constant. 

III Cell division with parasite infection and contamination We describe now the 
whole model. We start with a single cell with k parasites and denote by P& the 
associated probability. Unless otherwise specified, we assume k = 0. 

For every cell i £ T, conditionally on Z\ = x and f, = g, the numbers of parasites 
(Zio, Z\i) of its two daughter cells is distributed as 

trfw,4 1) «)+o; (p) w 1 yi 1, w), 

k=l 

where 

(i) (x[ (i),xi (i))fc>i is an i.i.d. sequence with common couple p.g.f g. 

(ii) (Yi 0) (i), Yi 1} (i)) is independent of (X^ (i),X^ (x))^. 
Moreover, ((X^ (i), X^\i)) k > u (yj 0) (i),Y x (1) (i))^) are i.i.d. for i G T. 
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Figure 1. Cell division with multiplication of parasites, random sharing and contamina- 
tion. Each parasite gives birth to a random number of light parasites and dark parasites. 
Light parasites go into the first daughter cell, dark parasites go into the second daughter 
cell and square parasites contaminate the cells from outside the cell population. But 
light/ dark/ square parasites then behave in the same way. 



generation generation 1 generation 2 




This model is a Markov chain indexed by a tree. This subject has been studied in the 
literature (see e.g. [HI El [9]) in the symmetric independent case. That is, V(i, k) £ T x N, 

P((Z i0 , Z a ) = (ko, h) \Z l = k) = P(Z i0 = k \Zi = k)F(Z iQ = k l \Z i = k). 

But this identity does not hold here since we are interested in unequal sharing of 
parasites. Guyon [12] proves limit theorems for a Markov chain indexed by a binary 
tree where asymmetry and dependence are allowed. His theorem is the key argument 
to prove the convergence of asymptotic proportions of cells with a given number of 
parasites here. Indeed, contamination ensures that the process which counts the number 
of parasites along the random walk on the binary tree of the cell population is ergodic 
and non trivial (see Section [5]). This is the fundamental assumption to use Guyon's law 
of large numbers. Let us then introduce more precisely this process which gives the 
number of parasites in a random cell line. 

Let (cij)j 6 N be an i.i.d. sequence independent of {Z\)- l& j such that 

P(oi = 0) = P(ai = 1) = 1/2. (2) 

Denote by (resp f^') the random p.g.f which gives the law of the size of the offspring 
of a parasite which goes in the first daughter cell (resp. in the second daughter cell): 

/(%):= f( S ,l) a.s., /«(i):=f(l,i) a.s., (s,t€[0,l]). 
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Let / be the mixed generating function of f(°' and f^\ i.e. 

p(/ e dg) = P(/(°) € dg) + P(/W £ dg) ^ 

Then (Z n ) neN = 

(Z(ai,a,2,..a n ))n£N is a Branching Process in Random Environment with 
immigration depending on the state is zero or not: the reproduction law is given by its 
p.g.f /, the immigration law in zero is distributed as Yq, and the immigration law in k > 1 
as Y\. Thus, we first need to prove asymptotic results for this process. 



2 Main results 

Galton Watson processes with immigration are well known (see e.g. [TJ HE]). If 
the process is subcritical and the expectation of the logarithm of the immigration is 
finite, then it converges in distribution to a finite random variable. Otherwise it tends 
to infinity in probability. Key [13j has obtained the analogue result for Branching 
Processes in Random Environment with Immigration (IBPRE), in the subcritical case, 
with finite expectation of the logarithm. Actually he states results for multitype IBPRE, 
which have been complemented by Roitershtein [T7] who obtained a strong law of large 
numbers and a central limit theorem for the partial sum. 

In Section^ we give the asymptotic behavior of IBPRE in the different cases, which 
means that we also consider the critical or supercritical case and the case when the 
expectation of the logarithm of the immigration is infinite. To get these results, we 
use some general statements on Markov processes (Section I3.2p . classical arguments for 
Galton Watson process with immigration (see [IS], which was inspired from pQ) and the 
tail of the time when IBPRE returns to in the subcritical case, which is proved in |13j . 

We can then state results about branching processes in random environment (Z n ) nG pj 
with immigration depending on the state is zero or not (Section [5]) using coupling 
arguments and Section 13.21 This process gives the number of parasites along a random 
cell line. Recalling that immigration in state zero is distributed as Yq and immigration 
in state k > 1 is distributed as Y±, we prove the following expected result. 



Theorem, (i) If E(log(/'(l))) < and max(E(log+(yi)) : % = 0, 1) < oo, then there 
exists a finite r.v. Zqq such that for every k £ N, Z n starting from k converges in 
distribution to as n — > oo. 

(ii) If E(log(/'(l))) > or max(E(log + (l^)) : i = 0, 1) = oo, then Z n converges in 
probability to oo as n — > oo. 

With additional assumptions, we provide in Section [5] an estimate of the rate of 
convergence of (Z n ) n£ ^ depending on the initial state. 

Then, in Section^ we prove asymptotic results on the population of cells in generation 
nasn-> oo. 

First, we consider the case when there is no contamination: Yq = Y\ = a.s. 
We determine when the organism recovers, meaning that the number of infected cells 
becomes negligible compared to the total number of cells. As stated in Proposition [21 the 
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recovery occurs a.s. iff E(log(/'(l))) < 0. Thus, we generalize results of Section 3 in [8] to 
random environment. Again, for any reproduction rate of parasites, we can find a neces- 
sary and sufficient condition on sharing of their offspring so that the organism recovers a.s. 

As explained in introduction, a natural example is the random binomial repartition 
of parasites. If the reproduction of parasites is given by the r.v. Z and the random 
parameter of the binomial repartition is given P G [0,1], the a.s. recovery criterion 
becomes 

log(E(Z)) < E(log(l/P)). 



Second we take into account the contamination by parasites from outside the cell 
population with assumptions ([T]). We focus on proportions of cells in generation n with 
a given number of parasites: 

Using [12] and the theorem above, we prove the following law of large numbers. 

Theorem. // E(log(/'(l))) < and max(E(log + (yj)) : i = 0, 1) < oo, then for every 
k G N, Fk(n) converges in probability to a deterministic number fj. as n — > oo, such that 
fo > and YlkLofk = 1 - 

Otherwise, for every k G N, Fk(n) converges in probability to as n —* oo. 

Finally, in Section [TJ we give the asymptotic behavior of the total number of parasites 
in generation n in the case when the growth of parasites follows a Galton Watson process 
and the contamination does not depend on the state of cell. 



3 Preliminaries 

We recall first some results about Branching Processes in Random Environment 
(BPRE) and then about Markov chains, which will be both useful to study BPRE with 
immigration (Z n ) n ^. Recall that we denote by k the initial number of parasites and by 
Pfc the probability associated with. 



3.1 Branching Processes in Random Environment (BPRE) 

We consider here a BPRE (Z n ) n6 N specified by a sequence of i.i.d. generating func- 
tions (/n)neN distributed as / [2J, [3l [19] . More precisely, conditionally on the environment 
(/n)neN) particles at generation n reproduce independently of each other and their off- 
spring has generating function f n . Then Z n is the number of particles at generation n 
and Z n+ \ is the sum of Z n independent random variables with generating function f n . 
That is, for every n G N, 

E(s z "+i\Z ,...,Z n ; /(,,...,/„) =fn(s) Z " (0<S<1). 

Thus, denoting by F n := fo o ■ ■ ■ o / n _i, we have for every fceN, 

E k (s Zn+1 I fo, fn) = n* Zn+1 \Z = k, f , /„) = F n {s) k (0 < 8 < 1). 
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When the environments are deterministic (i.e. / is a deterministic generating 
function), this process is the Galton Watson process with reproduction law N, where / 
is the generating function of N . 



The process (Z n ) n ^ is called subcritical, critical or supercritical if 

E(log(/'(l))) 

is negative, zero or positive respectively. This process becomes extinct a.s.: 

F(3n G N : Z n = 0) = 1 
iff it is subcritical or critical [2] (see |10j for finer results). 

In the critical case, we make the following integrability assumption: 

< E(log(^(l)) 2 ) < oo, E([l + log(/5(l))]/£(l)/2/£(l)) < oo, 
so that there exist < c\ < C2 < oo such that for every n G N (see [15] ) 

ci/Vra < HZ n > 0) < c 2 /Vn. (3) 
See [7] for more general result in the critical case. 



3.2 Markov chains 

We consider now a Markov chain (Z n ) ng ^ taking values in N and introduce the first 
time To when (Z n ) ne ^ visits after time 0: 

T := inf{< > : Z t = 0}. 

Denote by 

u n := ¥ (Z n = 0), Uoo := 1/E (T ) (l/oo = 0). 

By now, we assume < Po(^i = 0) < 1 and we give the asymptotic behavior of (Z n ) n€ fq. 
The first part of (i) is the classical ergodic property for an aperiodic positive recur- 
rent Markov chain and we provide an estimate of the speed of convergence depending 
on the initial state. Then (ii) gives the null recurrent case, which is also a classical result. 



Lemma 1. (i) If for every k £ N, E^(To) < oo, then Z n starting from k converges in 
distribution to a finite random variable Z^, which does not depend on k and verifies 

P(Zoo = 0) > 0. 

Moreover there exists A > such that for all n, k G N, 

£|P fc (Z n = 0-P(Zoo = 0l 

<A[ sup {\ Ul - Uoo \} + Eo(TolT 0> n/d+MToH>n/±)]- (4) 

n/2<l<n 

(ii) If Kq(Tq) = oo and for every I G N, P;(To < oo) > 0, then for every k G N, 
Z n — > oo in P '^-probability as n — > oo. 
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Proof of (i). First, note that by the Markov property, for every n £ N, 

\F k (Z n = 0) - Uoo \ 

n 

= | ^ P fc (T = jWZn-j = 0) - «oo| 



3=1 
n 



< ^Pfc(T = j)\Un-j - Uool + 1i 00 P fc (To > n). 
3=1 



(5) 



On the event {To < n}, define R n as the last passage time of {Z n ) n& ^ by before time n: 

R n := sup{i < n : Zi = 0} . 

For all < i < n and I E N, by the Markov property, 

rt 

¥ k (Z n = I) = P fc (T > n, Z n = l)+J2 W k(T <n, R n = n-i, Z n = l) 

i=0 
n 

= P fc (T > n, Z n = I) + J> fc (Z„_, = 0)F (Zi =l,T > i). 



Define now 



i=0 



ai := Uoo^FoiZi = 1, T > i). 
i=0 



We then have 

<x> 

\F k {Z n = 1)- a, | < P fc (T > n, Z n = l) + Uoo = I, T > i) 

i=n+l 

n 

+ ^P(Z, = Z, T >i)\u OQ -F k (Z n _ i = Q)\. 

i=0 

Summing over I leads to 

Y,\^k(Z n = I) ~ a t \ < F k {T >n) + n oo E (r iT >n+i) 



+ W(T > i) | Uoo - P fc (Z n _i = 0) 
j=0 

Moreover using ([5]), we have for all < uq < n, 

n 

^P(T > i)\ Uoo -F k (Z n „i = 0)| 



(6) 



i=0 

n 

< ^P (T >i) 

i=0 



< ^P (T >i)^P fc (T =i)| 

3=1 



^P fc (T = j)\u n -i-j - I + nooP fc (T >n-i) 

n 

J2n(T >i)F k (T >n-i){7) 



3=1 
n—i 



i=0 



i=0 



Finally, denoting by M := sup nGN {|it n - «oo|}, 



J>o(T >i)^P fc (T =j)|n n . 



•i— j ^oo 



i=o i=i 



< sup {\ Ul - Uoo \}J2F (T > i)J2 F k( T o = j)l 

n <l<n i=Q j=1 



n-i-j>n 



n—i 



+M^P (T > i) ^P fc (T = j)l n _ i _ i<no 

i=0 j=l 



n n—i 



< sup {h- Uoo |}^P (To>i)^P fc (T =j) 



n—no 



+M Wo(To >i)Pfc(T >n-no-»). (8) 

i=0 

Combining (jSJ), ([7]) and ([5]) and using that 

n 

^P (T > i)¥ k (T >n-i)< E (T i To>n/2 ) +E fc (T i To>n/2 ), 

i=0 
n—no 

Y Po(T > i)P fc (T > n - n Q - i) < E (T ]l To>(n _ no)/2 ) 
i=0 

we get, for all < no < n, 



J2\¥ k (Z n = l)-ai\ < ¥ k (T >n) + Uoo E (T t To>n+l )+ sup {\m - Uoo \}E (T ) 

Z£N n <l<n 

+[ Uoo + M][E (r i ro>(n _ no)/2 ) + E fc (r % >(n _ no)/a )]. (9) 



As Po(Zi = 0) > 0, by the renewal theorem [TT], u n — u^. Adding that Efc(To) < oo 
and Eo(To) < oo ensures that 

Y / \V k (Zn = l)-a l \ n =^0, 

which proves that Z n starting from k converges in distribution to a r.v. which does 
not depend on k. 

The inequality of (i) is obtained by letting no = ra/2 in @. □ 



Proof of (ii). If Eo(To) = oo, then by the renewal theorem again [llj, 

U n ► 0. 

So 

D n = inf{/£ — n : k > n, Z k = 0} "-^5? oo, in probability. 



Assume that there exist I 6 N, e > and an increasing sequence of integers (u n ) ne ^ such 
that 

W k (Z Un =l)>e. 
As P/(To < oo) > by hypothesis, there exists N > such that 

P,(T = N) > 0. 

Thus, by the Markov property, 

P fc (Z nn+i ^ = 0) > P fc (Z Mn = /)Pi(r = N) > eFi(T = N). 
Then, for all n e N, 

Pfc(^n„<iV)>eP/(To = iV)>0, 

which is in contradiction with the fact that D n — ► oo in P& as n — > oo. Then, Pfc(^ n = 
Z) — > as n — > cxd. □ 



4 Branching processes in random environment with immi- 
gration (IBPRE) 

We consider here a BPRE (Z„)„ 6 n whose reproduction law is given by the random 
p.g.f / and we add at each generation n + 1 a random number of immigrants Y n indepen- 
dent and identically distributed as a r.v Y such that 

P(y = 0) > 0. 

More precisely, for every n £ N, 

Yn + ^Xt, (10) 

i=l 

where (Xj)j G pj, Y n and Z n are independent and conditionally on f n = g, the (A"j)j 6 pj are 
i.i.d. with common probability generating function g. 

Note that if the contamination does not dependent on the fact that this cell is already 
infected or not (i.e. Yq and Y\ are identically distributed), then the number of parasites 
in a random cell line defined in Introduction is a IBPRE whose reproduction law given 

by / and immigration by Y = Yq = Y\. 

We give now the asymptotic behavior of this process. These results are classical for 
the Galton Watson process with immigration [TJ [TH] . We follow the same method in the 
case of random environment for the subcritical and supercritical cases. We give in (ii) 
the tail of the time 

T = inf{n > : Z n = 0} 

when the process returns to in the subcritical case, which is proved in [T3] and we use 
Section 13.21 for the critical case. 
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Proposition 1. (i) If E( log(/'(l))) < and E(log + (Y)) < oo, then Z n converges in 
distribution to a finite random variable as n —* oo and \im n ^ 00 ¥(Z n = 0) > 0. 
Otherwise Z n — > oo as n — > oo. 

(«i) 7/E( log(/'(l))) < and there exists q > suc/i i/tai K(Y q ) < oo, then there exist 
c, d > such that for every n 6 N, 



C (T > n) < ce' 



fm) Assume E(/'(l) 1 ) < 1 and E(log + (Y)) < oo, £/ien £/iere exists a finite r.v. W 
such that 

liy-JfKiT'Zn^w, »P. 



Note also that by the Borel-Cantelli lemma, if E(log + (Y"i)) = oo, then for every c > 1, 

limsupc~ n Z n = oo a.s. 

n^oo 

since Z n > Y n a.s. Moreover the proof of Section provides an other approach to prove 
that (Z n ) n& fq tends to oo if E(log + (Y)) = oo. 

Proof of (i) and (ii) in the subcritical case: E(log(/'(l))) < 0. The subcritical case with 
assumption E(log + (Y)) < oo is handled in [13]: First part of (i) is Theorem 3.3 and (ii) 
is a consequence of Theorem 4.2 of |13| . 

We focus now on the case E(log + (Y)) = oo and prove that Z n converges in probability 
to oo. The proof is close to the Galton Watson case (see pQ or |18]). First, by Borel- 
Cantelli lemma, 

limsuplog + (Yfc)/fc = oo a.s. 

k^oo 

Then, for every c G (0, 1), 

lim sup c k Yk = oo a.s. (11) 

fc^oo 

Note that 

n-l 
k=0 

where Zk n is the number of descendants in generation n of immigrants in generation 
n — k. Thus, denoting by Yk, n the number of immigrants in generation n — k and Xi(k, n) 
the number of descendants in generation n of immigrant i in generation n — k, we have 

n— 1 ^fc,™ 

Z n = ^2^2Xj(k,n). 

k=0 i=l 

This sum increases stochastically as n tends to infinity and converges in distribution to 

oo Y k 

k=0 i=l 
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where conditionally on (/j : i £ N), {Xi(k) : i £ N, k E N) are independent and the 
probability generating function of X- L (k) is equal to /^-l o ... o f Q . Roughly speaking, 
Xi(k) is the contribution of immigrant i which arrives k generations before 'final time' 
oo. The integer Xi(k) is the population in generation k of a BPRE without immigration 
starting from 1. 

Assume now that Z ra < oo with a positive probability. As (Xi(k) : k £ N, 1 < i < Yj~) 
are integers, then conditionally on < oo, only a finite number of them are positive. 
Thus, by Borel-Cantelli lemma, conditionally on (Z^ < oo, Yj% : k £ N, fi'.iE N), 

oo 

^Y fc P(Xi(£;) > 0) < oo a.s. 

k=0 

Moreover, by convexity, for all g p.g.f and s 6 [0, 1], 

lz»W = gglzjW > gWz»g) = , 

1 — s 1 — s 1 — 

Then 1 — g(s) > (1 — (7(0)) (1 — s) and by induction, we have for every k £N, 

P(*i(fc) > | fi : * € N) = l-/fc_io...o/ (0) 

> n^a-z^o)) 

= exp(S , fc ), 

where Sfc := Si=o l°s(l ~~ fi(®))- Thus, conditionally on (Z^ < oo, : k € N, fi'.iE 
N), 

oo 

^jY fc exp(5 fc ) < oo a.s. 
fc=o 

Thus, on the event {Z^ < oo} which has a positive probability, we get 

oo 

Yfc exp(5*fc) < oo a.s. 

fc=o 

Moreover S" n is a random walk with negative drift E(log(l — /o(0))). So letting a < 
E(log(l — /o(l))), P(5' ra < an) decreases exponentially by classical large deviation results. 
Then by Borel-Cantelli lemma, S n is less than an for a finite number of n, and 

L := inf \S n — an] > — oo a.s. 
Using that for every k S N, Sk > ak + L a.s., we get 

oo 

exp(afc)Yfc < oo, 

fc=0 

with positive probability. This is in contradiction with (jlip . Then = oo a.s. and Z n 
converges in probability to oo as n — > oo. □ 
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Proof of (i) in the critical and supercritical case: E(log(/'(l))) > 0. First, we focus on 
the critical case. Recall that To = inf{i > : Z{ = 0} and consider (Z n ) n ^ the 
BPRE associated with (Z n ) n£ ^, that is the critical BPRE with reproduction law / and 
no immigration. Thanks to ([3]), there exists c\ > such that for ever n £ N, 

Pl(^n > 0) > ci/Vn. 

Adding that 

Pi (T > n) = P x (Z n > 0) > ¥ x (Z n > 0), 

ensures that 

Ei(T ) = oo. 

Then Eo(To) = oo since IBPRE (Z n ) nG ^ starting from 1 is stochastically larger than 
(Z n ) ng N starting from 0. Moreover Mk £ N, Pfc(To < oo) > 0, since Pfc(To < oo) = 1 and 
P(Y = 0) > 0. Then Lemma [U (ii) ensures that Z n — > oo in P as n —* oo. 

For the supercritical case, follow the proof in the critical case (or use the result with 
a coupling argument) to get that Z n — > oo in probability as n — > oo □ 



Proof of (Hi). We follow again [18] . If E(log + (l")) < oo, by Borel-Cantelli Lemma 
Then for every c > 1, 



limsuplog + (Yfc)//c = 0. 



c- k Y k < oo a.s. (12) 

fc=0 

Define 

Pn ■= [n^/Ki)]- 1 , 

and denote by T n the a- field generated by (Zi : < i < n), (Pi : < i < n) and 
(Yfc : k € N). Then using (TlOl) . we have 

E(P n+1 Z n+1 | = E(P n+1 [^X i + y n ] I 

= PnEif^l)- 1 Xi I T n ) + PnEf/'Cl)- 1 )^ 
i=l 

= P v E(f n {l)- 1 Z v K(X 1 | /„) | ^ n ) + p n E(/'(i)- 1 )y n 
= p n z n + p n E(/'(i)- 1 )y n . 

So P n Z n is a submartingale. Moreover 

n-l 

E(P„Z n |^ ) = Zo + ^EC/'Cl)- 1 )^ 1 ^. 

i=0 

By (fT2"j) . if E(/'(l) _1 ) < 1, P n Z n has bounded expectations and then converges a.s. to a 
finite r.v. □ 
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5 Ergodicity and convergence for a random cell line 



Recall that (Z n ) n ^ defined in Introduction is the number of parasites in a random 
cell line. The Markov chain (Z n ) nG ^ is a BPRE with state dependent immigration. The 
reproduction law is given by the p.g.f /, immigration in state is distributed as Yq 
and immigration in state k > 1 is distributed as Y\. More precisely, for every n G N, 
conditionally on Z n = x, 

X 

2 n+1 = yi n ) + ^lf) where 
i=i 

(i) (X^ )ieN an( i Yx are independent. 

(ii) Conditionally on f n = g, the (X- are i.i.d. with common probability generating 
function g, 

(iii) For all x > 1 and n £ N, Yj n) = Y x . 

We have the following results, which generalize those of the previous section to the 
case when immigration depends on whether the state is zero or not. 

Theorem 1. (i) 7/E(log(/'(l))) < and max(E(log + (Ti)) : i = 0, 1) < oo, then there 
exists a finite random variable such that for every k 6 N, Z n starting from k converges 
in distribution to Z^ as n — > oo. 

Moreover, if there exists q > such that max(E(Y^) : i = 0, 1) < cxd, then for every 
e > 0, i/iere exisi < r < 1 and C > suc/i £/ia£ /or all n £N and k E N, 

oo 

^ |p fc (z n = o - p(^oo = 01 < c ^ n - 

fii) // E( log(/'(l))) > or max(E(log + (Yj)) : % = 0, 1) = oo, Z ra converges in 
probability to infinity as n — > cxd. 

Note again that by Borel-Cantelli lemma, if E(log + (Yi)) = oo, then for every c > 1, 

limsupc _n Z n = oo a.s., 

since Z n > Y n a.s. 

The proof of (ii) in the critical or supercritical case (E(log(/'(l))) > 0) is directly 
derived from Proposition [T] and we focus now on the subcritical case: 

E(log(/'(l))) <0. 

Recall that To is the first time after when (Z n ) n ^ visits 0. Using IBPRE (see Section 
HJ, w e prove the following result in the subcritical case. 
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Lemma 2. If max(E(log + (Yj)) : i = 0, 1) < oo, then for every kN, Pfc(To < oo) = 1 and 

sup{P fc (Z n > 0} ^ 0. 

neN 

Moreover if there exists q > suc/i i/ia£ max(E(l^' 3 ) : i = 0, 1) < oo, then for every e > 0, 
i/iere exisi r > anc? C > suc/i i/iai /or aZZ n G N, k > 1: 

Po(r > n) < Cr n , F k (T > n) < Ck e r n . 

Proof. We couple (Z n ) ng pj with an IBPRE {Z n ) n ^ with reproduction law given by the 
random p.g.f / (such as (Z n ) ne ^) and immigration Y defined by 

Y :=max(Y , Y Xl Y), 
where Yq, Y±, and Y are independent and Y is defined by 

oo 

P(y = 0) = 1/2; Vn € N*, P(Y = n) = an" 1 " 6 , a := [2 ^ r 1-e ] -1 . 

8=1 

Thus immigration Y for Z n is stochastically larger than immigration for Z n (whereas 
reproduction law is the same), so that coupling gives 

Vn € N, Z n < Z n a.s. 

Moreover, Z n is still subcritical. Recalling that min(P(Yi = 0) : i = 0, 1) > 0, P(Y = 
0) = 1/2, and that the expectation of the logarithm of every r.v. is finite, we have 

E(log+(Y)) < oo, P(Y = 0) > 0. 

Then Proposition [T] (i) ensures that Z n converges in distribution to a finite random 
variable, so that 

sup{P fc (Z n > I)} < aup{P fc (Z n > /)} ^ 0. 

Proposition [1] (i) ensures also that for every k E N, lim n _ i . 0O Pfc(Z n = 0) > 0. Thus, for 
every k G N, Pfc(To < oo) = 1 and then Pfe(To < oo) = 1. This completes the first part of 
the lemma. 

We assume now that there exists q > such that max(E(Y i <J ) : i = 0, 1) < oo. 
Moreover E(Y e / 2 ) < oo, so letting q' = min(e/2, q), we have 

E(Y 9 ') < oo. 

We can then apply Proposition [T] (ii) to IBPRE {Z n ) n& ^, so that there exist c, d > such 
that for every n S N, 

Po(?o > n) < ce - * 1 . 

Recalling that for all k, n G N, 

Pfc(Tb > n) < P fc (f > n), 
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we get Po(7o > n) < ce dn . Moreover for every k G N, 

>n)> p(y > k)F k (n > n ). 

By definition of Y, there exists j3 > such that for every nGN, 

P(Y > n) > (3n~ e . 

Using these inequalities gives 

Pfc(T > n) < F k (T >n)< /T^PoC^ > n) < 0- 1 ck e e~ dn . 
This completes the proof. □ 



Proof of Theorem^ (i) and (ii) in the subcritical case: E(log(/'(l))) < 0. We split the 
proof into 4 cases: 

CASE 1: max(E(log+(y,)) : * = 0, 1) < oo. 

CASE 2: There exists q > such that max(E(Y/) : % = 0, 1) < oo. 
CASE 3: E(log+(Yi)) = oo. 
CASE 4: E(log+(Y )) = oo. 

First, note that P(lo = 0) > ensures that Po(-Zi = 0) > and we can use results of 
Section 13.21 

CASE 1. In this case, by Lemma (2j {Z n ) n ^fi is bounded in distribution: 

sup{P (Z n >0}^0. 

neN 

If Eo(7o) = oo, then Z n — > oo in Po by Lemma [T] (ii), which is in contradiction with the 
previous limit. 

Then Eo(To) < oo. We prove now that \/k > 1, Efc(To) < oo by a coupling argument. 
Let k > 1 and change only immigration to get a Markov process {Z n ) n ^ which is larger 
than {Z n ) ne ff. ^ 

Vn G N, Z n > Z n a.s. 

Its immigrations Yq and Y\ satisfy 

Yi = Y u Vn G N, P(y > n) > F(Y > n), 

F(Y >k)>0, max(E(log(li) : i = 0, 1) < oo. 

Then, we have again Eo(To) < oo, which entails that Efc(To) < oo since P(Yo > A;) > 0. 
As for every n G N, Z n > Z n a.s., we have 

E fc (T ) < E fc (fo) < oo. 
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Then Lemma [T] (i) ensures that for every k G N, (Z n ) ng jj converges in distribution to a 
finite random variable Z^, which does not depend on k and verifies F{Z 00 = 0) > 0. 

CASE 2: By Lemma H](i), we have 
J2\Wk(Z n = l)-F{Z oc = l)\<A[ sup {| 

n/2<l<n 

; as) 

Moreover by Lemma [2j for every e > 0, there exists C > such that 

Pfc(T > n) < Cfc £ r n , P (T > n) < Cr n . (14) 

So for every r' G (r, 1), Eo(exp(— log(r)To)) < oo. Then, by Kendall renewal theorem 
|14j . there exists p G (0, 1) and c > such that for every n G N, 

|«n - Wool < cp n . (15) 

Finally, (|14p ensures that there exists D > such that for every n G N, 

E (T ll To>n/4 ) < DnW 4 , 
E fc (T ll To>n/4 ) < DnfeV"/ 4 . 

Combining these two inequalities with (I13p and (|15p . we get 

\^k{Z n = l)~ HZoo = l)\< A[cp n + Dnr n / 4 + Dnk e r n ^] , 

which ends the proof in CASE 2. 

CASE 3. Change immigration of (Z n ) nG ^ to get an IBPRE (Z n ) ne ^ whose immigra- 
tion is distributed as Y\ and whose reproduction law is still given by /. Then Proposition 
[T] (i) and E(log + (Yi)) = oo ensures that {Z n ) n ^ starting from tends in distribution to 
oo. 

Then Lemma [T] (i) entails that Eo(Tq) = oo, so that for every k > 1, 

E fc (7b) > E (f ) = oo, 

since the IBPRE (Zn)neN starting from k > 1 is stochastically larger than (Z n ) n£ ?q 
starting from 0. 

Moreover, under P&, {Z n ) n ^ is equal to (Z n ) ne ^ until time To = To. So Efc(To) = oo. 
Let k > 1 such that P (Zi = fc) > 0, then E (T ) > P (Zi = k)E k (T - 1). This entails 
that 

E (T ) = oo. 

By LemmaQ] (ii), (Z n ) ne ?q starting from any k G N tends to oo in probability. 

CASE 4. Denote by 

X, := F(Z, > | Z f _! = 1, (i > 1), 
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the survival probability in environment and introduce the following random walk 

n 

S n = Y J ^g{X i ). 

1=1 

Then 

Pi(Z n > | (/„, /i, / n _i)) > n?JQ = exp(,S n ) a.s., 

so that 

F k (Z n >0 | (/ ,/i,. ..,/„-i)) = 1-P*(£„ = 0| (/ ,/i,...,/„-i)) 

= 1- [l-Pi(Z n >0 | (/ ,/l,...,/n-l))] fe 

> 1 - [1 - exp(5 n )] fe a.s. 

Thus 

P fc (Zn >0) >E(1- [l-exp(S n )] fc ). 
Using the Markov property we have 



E (T + 1) > ^P(y = fc)E fc (T ) 



fc=i 

oo 



k=l n=l 



oo oo 



> ^P(y = A:)^P fc (^>0) 

fc=l n=l 

oo oo 



fc=l n=l 

Moreover for all x G [0, 1[ and k > 0, exp(/clog(l — x)) < exp(— A;x), and by the law of 
large numbers, S n /n tends a.s. to E(Xi) < so that there exists no > 1 such that for 
every n > tiq, 

F(S n /n > 3E(Xi)/2) > 1/2. 

We get then 

oo oo 

E (T + 1) > ^P(r = £:)^E(l-exp(-£;exp(S n ))) 

k=l n=l 

oo oo 

^ [l-e" 1 ]^^P(A : exp(5 n )>l)P(yo = A:) 

n=l k=l 

oo oo 

> [1 - e- 1 ] nSJn > 3E(Xi)/2) £ P(y = fc) 

n=«0 fc>cxp(-3nE(Xi)/2) 
oo 

> 2 _1 [1 - e" 1 ] P ( y o > exp(-3nE(Xi)/2)) 

n=«o 
oo 

> 2-Hl-e" 1 ] P(/31og(y ) >n), 

n=«o 
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where j3 := [-3E(Xi)/2] _1 > 0. Then E(log(Yb)) = °° ensures that E (T + 1) = oo, so 

E (T ) = oo. 

Conclude that (Z n ) ne ^ tends to oo in using Lemma [U (ii). □ 



6 Asympotics for proportions of cells with a given number 
of parasites 

6.1 Asymptotics without contamination 

Here there is no contamination, i.e. Yq = Y\ = a.s. and we determine when the 
organism recovers, meaning that the number of contaminated cells becomes negligible 
compared to the total number of cells. We get the same result as Theorem 1 in [8] for the 
more general model considered here. Denote by N n the number of contaminated cells. 

Proposition 2. N n /2 n decreases as n grows. 
J/E(log(/' (1))) < 0, then N n /2 n -» a.s. as n -> oo. 

Otherwise, N n /2 n — > as n — > oo iff all parasites die out, which happens with a probability 
less than 1. 

Example. Consider the case of the random binomial repartition of parasites mentioned 
in Introduction. Let Z £ N be a r.v and (P^^j be an i.i.d. sequence distributed 

as a r.v. P £ [0,1], such that P = 1 — P. In every generation, each parasite gives 
birth independently to a random number of parasites distributed as Z. When the cell i 
divides, conditionally on P 1 = p, each parasite of the cell i goes independently in the first 
daughter cell with probability p (or it goes in the second daughter cell, which happens 
with probability 1—p). Then, 

P(/'(l) 6 dx) = F(E(Z)P e dx). 

Thus, the organism recovers a.s. (i.e. N n /2 n tends a.s. to 0) iff 

log(E(Z)) < E(log(l/P)). 

This is the same criteria in the case when the offspring of each parasite goes a.s. is 
the same daughter cell (there, p is the probability that this offspring goes in the first 
daughter cell.) 



Proof. Note that N n /2 n decreases to L as n — > oo, since one infected cell has at most two 
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daughter cells which are infected. Moreover, for every n G N, 

'N n \ E(E ie 6 n ^>o) 



E 



2 n / 2™ 



2 

= £>((a ,...,a„_i) = i)P(Zi>0) 

i€<G„ 

= nz n >o). 

If E(log(/'(l))) < (subcritical or critical case), then f(Z n > 0) tends to as n — > oo 
(see Section [3TTT) . Thus, E(L) = and N n /2 n tends to a.s. asn-> oo. 

If E(log(/'(l))) > (supercritical case), then F(Z n > 0) tends to a positive value, 
which is equal to P(L > 0)0. We complete the proof with the following lemma. □ 

Let us prove the following zero one law, where P n is the total number of parasites in 
generation n. 

Lemma 3. J/E(log(/'(l))) > 0, then 

{ lim N n /2 n > 0} = {Vn G N : P n > 0} a.s. 

Proof. First, we prove that conditionally on non-extinction of parasites, for every K G N, 
there exists a.s. a generation n such that N n > K. Letting K G N, we fix p as the first 
integer such that 2 P > K. Then q := Pi(iV p > K) > since f(N x = 2) > 0. 
Either the number of infected cells in generation p is more than K, which happens with 
probability q, or we can choose in generation p an infected cell i(l), since parasites have 
not died out. Then, with probability larger than q , the number of infected cells in 
generation p of the subtree rooted in this cell i(l) contains more than K parasites. Note 
that this probability is exactly equal to q iff the infected cell i(l) contains one single 
parasite. Recursively, we find a.s. a generation n such than N n > K. 

Then, recalling that we still work conditionally on non-extinction of parasites, the 
stopping time T := inf{n G N : iV n > K} < oo a.s. We now also condition by T = n 
and Nt = k. We can then choose one parasite in every infected cell in generation n, 

(i) 

which we label by 1 < i < k and we denote by Np the number of cells in generation 
n + p infected by parasites whose ancestor in generation n is the parasite i. By branching 
property, the integers {N p {i) : 1 < i < k) are i.i.d. and Np /2 P — ► LW as p — > oo, where 
(L® : 1 < i < k) are independent and P(L (i) > 0) = P(L > 0) > for every 1 < i < k. 
Using that 



N n+P >J2NP a.s., 



i=i 



and as k > K, we get 

lim N n+P /2 P > max(L (i) : 1 < i < K) a.s. 



As sup(L^) : i G N) = oo a.s., letting K — > oo ensures that a.s. N P /2 P does not tend to 
0. □ 
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6.2 Asymptotics with contamination in the case E(log(/'(l))) < and 
max(E(log + (Yi)) : % = 0, 1) < oo. 

Define Fk(n) the proportion of cells with k parasites in generation n: 

We introduce the Banach space Z*(N) and the subset of frequencies 5'5' 1 (N) which we 
endow with the norm || . ||i defined by: 

oo oo 

^(N) := {(xj) ie N : ^2 < I' ( x *)*eN ||i= ^ \xi\, 

i=0 i=0 

oo 

SS 1 ^) := {(fi) im :V ! GN,/ l eM + ,^/ i = 1}. 

The main argument here is the law of large number proved by Guyon [12] for asymmetric 
Markov chains indexed by a tree. 



Theorem 2. 7/E(log(/'(l))) < and max(E(log + (y;)) : % = 0, 1) < oo, then {F k (n)) keN 
converges in probability in SS 1 (N) to a deterministic sequence (fk)keN as n ~~ * °°> such 
that fo > and J2kLo fk = !• Moreover, for every k G N, f k = P(^oo = k). 

Proof. Recall that (Z^^j is a Markov chain indexed by a tree and we are in the framework 
of bifurcating Markov chain studied in |12j . Thanks to the ergodicity of the number of 
parasites in a random cell line proved in the previous section (Theorem [T] (i)), we can 
directly apply Theorem 8 in [12] to get the convergence of proportions of cells with a 
given number of parasites. □ 



But it seems that we can't apply Theorem 14 or Corollary 15 in [12] to get a.s. 
convergence of proportions, because of the term k € in estimation of Theorem [TJ For 
examples, we refer to the previous proposition. 



Using again [12], we can prove also a law of large numbers and a central limit theorem 
for the proportions of cells with given number of parasites before generation n. Define, 
for every n G N, 

Pk{n) . = #{ I6 ^.G,:3 = *} (keN) 

Theorem 3. 7/E(log(/'(l))) < and max(E(log+(Ti)) : i = 0, 1) < oo, then (P fc (n)) fceN 
converges in probability in S'S' 1 (N) to the deterministic sequence (fk)keN as n ^ oo. 
Moreover for every k G N, y / n(Pfc(n) — f k ) converges in distribution to a centered normal 
law as n — > oo, with a non explicit variance. 
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Proof. Use again Theorem [T] (i) and Theorem 8 in [12] to prove the law of large numbers. 
For the central limit theorem, use Theorem 19 in [12] by letting F be the set of continuous 
functions taking values in [0, 1]. □ 



6.3 Asymptotics with contamination in the case E(log(/'(l))) > or 
max(E(log + (Yi)) : % = 0, 1) = oo. 

In this case, cells become infinitely infected as the generation tends to infinity. 

Theorem 4. If E(log(/'(l))) > or max(E(log + (Tj)) : i = 0, 1) = oo, for every k E N, 
then Fk(n) tends to zero as n — > oo. That is, for very K E N, 

lim #{i E G n : Z x > K}/2 n = 1. 

n— >oo 

Proof. By Fubini's theorem, we have 

E[#{i E G n : Z, > K}/2 n ] = ^ P(Z X > K)/2 n 

= ^ P((ao, ...a n -i) = i)P(^ > 

iG<G„ 

= P(Z n >i^). 

By Theorem [TJ P(Z n > tends to 1, then 1 — #{i £ G„ : Z, > iC}/2 n converges to in 
L j which gives the result. □ 



7 Asymptotics for the number of parasites 

We assume here that parasites multiply following a Galton Watson process with de- 
terministic mean m, independently of the cell they belong to. That is, s *— > f(s, s) is 
deterministic and every parasite multiply independently with the reproduction law whose 
p.g.f. is equal to g : s i— > f(s, s). Moreover we assume that contamination of a cell does 
not depend on the number of enclosed parasites. That is 

Y = Y = Y 1 . 

Set P n the number of parasites in generation n. Without contamination, in the 
supercritical case m > 1, it is well know that either P n becomes extinct or P n /m n 
converges to a positive finite random variable. In the presence of contamination, we have 
the following result. 

Proposition 3. If K(Y) < oo and P(lb > 0) > 0, then log(P n )/n converges in P to 
log(max(2, to)). 
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Proof. First, we prove the lower bound. This is a consequence of the fact that P n is larger 
than 

(i) the total number of parasites P^ which contaminate cells of generation n, 

(ii) the number of parasites P n (p) in generation n with the same given parasite ancestor 
in generation p. 

Indeed, first P^ is the sum of 2 n i.i.d. random variables with mean E(Y), so law of 
large numbers ensures that 

Plj2 n ™ E(Y) > in P 
Then, since P n > P* a.s. for every n E N, 

P n oo in P. (16) 

Moreover for every p < n, 

P n (p)/m n - p ™ W, a.s., (17) 

with W(W > 0) > 0. Let now Pi be the sum of the number of descendants in generation 
n of each parasite of generation p. We get then the sum of P p i.i.d. quantities distributed 
as P n {p) . Then (fT6|) and (fT7|) ensure that we can choose p such that 

Pl/m n - pn ^?W a.s. 

with F(W > 0) > 1 - e. 

Using that N n is larger than P^ and P% ensures that for every e > 0, 

limsupP(log(P n )/n < log(max(2, m))) < e. 

n^oo 

Letting e — > gives the lower bound. 

Second, we prove the upper bound. Note that the total number of parasites in gener- 
ation n can be written as 

i=i j=i k=i 

where Y % ^ is the number of parasites which contaminate the jth cell of generation i, and 
labeling by 1 < k < Y l > 3 these parasites, Z 1 ^ is the number of descendants in generation 
n of the A:th parasites. 

Moreover (Y l,J : i G N,j S N) are identically distributed and independent of (Zp J (k),i E 
N,j E N, ken), (Z% j ,i E N,j E N, k E N) are independent and Z l p ' J (k) is the 
population of a Galton Watson process in generation n — i with offspring probability 
generation function equal to g. Thus 



J k , 



n 2 1 

i=i j=i k=i 

n 2' 

= ^^E(Y^)E(Z^') 
i=i j=i 

n 2 1 

= E(Y)]TJ>— 

8=1 j=l 
n _ nn 

= 2E(Y) ifm/2. 
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If m = 2, then E(P„) = E(y)nm n . This gives the upper bound by Markov inequality 
and completes the proof. □ 
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